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This paper describes the implementation of a typical data-base 
manager for an A.I. language llfce Planner^ Conniver, or QA4 t and 
some propped extensions for applications involving greater quantities 
of data than usual. The extensions are concerned with data bases 
involving several active and potentially active sub-data-bases* or 
"contexts". The major mechanisms discussed are the use of contexts 
as packets of data with free variables; and indexing data according 
tu the contexts they appear in. The paper also defends the Planner 
approach to data representations against seme more recent proposals. 
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V SO UHAT? 


[ rNTmoucriiw 


One of tho durab»B ideas In ft] ia the p a 11 erp-orr on ted data baea 
developed ty Carl Heultt far the original Planner programming language. 
<Heul 11 , 1371 > ft has appea.ed to many people because it geema to be a 
good framework for imp Igqenting a "blackboard - eommunication channel 
bctHomn processes. <NeucN, 19G2> 

Suph a d 5 ta base operates by indexing records 30 that they may jjs 
■ accessible with any of several different key patterns. This Is the earn a as 
the more traditional "file IrtvSraion" <ftivest, 1974>, ewcept that the 
patterns ar® more couple*. The data daae also provides primitives for 
copying and combining collections of data, these facilities can be used to 
implement ccntroPled “deductive 1 procedures which limit their searches to 
formulas that have been included in the current local data base, and which 
ar® like y to match the current goal a. When such data has a a were 
(mp I chanted (by various people in different nays. e.g. f dRul i fson et. aT.* 
J97Z>i. <Sueaman and HcOdrmott* , they were found to help overcome the 

obstacles facing previous deductive programs. 

However, recently 6 uch data bases have cone in for aone firlticiam. 

Sone critics feel tnat set! of pattern-accessible records are wrong (or 
fliaguidtd) 35 a data structure for Al applications. They usually recommend 
some more strongly: organized structures they gftfcn appeal to the concept Of 
"frame." <H leaky, 197A> 

This CriticisTi 5 OUttr eased by the fesiing among nary people that 
Planner-type data base* are Intrinsically hard to organise e-wccpt for email 
"toy" prod lens. Scott Fahlnan c2375> has argued that the usual methods Of 
organising them will break down completely when we try to implement data 
basco of the size the human brain nuet contain. 


Although it is possible, and done ail the tine, to misoryanlie a- 
Planner data base, T fed that the critics are being unfair* Jn fad* > t 
ec«ms as though the frame i flea f I awe naturally fron the nature of PlafVfis-r- 
type data c a eb me chan ism, krfeC lud i fug the "content" nechanism of QA4 and 
Conniva-^b lr< par titular, the net ions of hypotheg i a .-dr i van recogn! t ion 
triggered by suggestion denon* <Fah>man T 1973=-, of default values in frame 
Blots, enplaning discrepancies, frame transit ion, etc. P seem to oa 
descended fron ideas tike consequent vs. antecedent reasoning xHewitt, 
l37l> H making asaufiptiens based on partial evidence <ndh;rtio11, 1374^3-, &rnd! 
Debugging a lightly urgng data bases ^Suasnan, 1975>. As far as I can toil, 
a frame is just a particular nay to use a Conniver context* 

Host frane proposals, houever, do hdt Suggest further ambelI■shments of 
an a I ready sound framework, but instead tend to propose archaic data 
structures with lists e-f slots and values, or a snail Set Of interfaces and 
Onssibie messages. The only reason for this seems to Ce a u Ida spread 
acceptance of the second criticism, that a large PI goner-type data base 
could net be i np lamented efficient ly+ By "large' i mean iarge both 
physican ly, requiring secondary a terage h ith tcday h s techno Iegy^ and 
organ!aatIcnaity, requiring care to avoid useless computation. It is the 
purpose of this paper to show that the case Is unproven. Bee Idas having 
this narrow technical intention, I intend to say a Nttls about Mhal Kinds 
of operations ought tc be efficient In an AE data base, and which proposed 
data strueturn might be the besti 


31 A TYPICAL DATA BASE MANAGE* 


In this g^ciicn, I Mill describe the i up I errant at ion of a typical 
Planner-Mka data Base manager* ( 7 D 3 MK The description i 4 drawn from my 
experience ui th several versions at Conniver <MdD«rHott and Susaman, 1973 >, 
Out Is somewhat Ideal rzed + so as to die tract you from merely naphajard 
fault® df existing systems and Shew you their deeper prefclenB. 

IK A Behavior 

There are three important features of a typical data base management 
System: "partial-match" record retrieval, multiple data Oases, and derived 
data. 


I KA*i "Part I 3 i-natch" Record fletrifcvaf 

Any record nay Be accassed by specifying any subset Of its components. 
With, each specified conpgnent in its proper position* The unspecified, or 
'’don, 1 t cars,” parts may be indicated by question marks* ror example, the 
record [P If al> may Be referred to by Keys (P 7 ], f? (? a )l + fP (? &}} + 

ftecords in the data base, called ftemi, are indox&d when they are put 
there taddetf:i| and cm indexed when thay are rstfOvecf. Th e ratrievai function 
is FETCH, Which returns ail the present 1 tens which match Its pattern 
argument* for axanple, if (P (f aM E (P (f bl ) , and {P cl are present. 


(FETCH 1 <P tf a)1} returns HP (f a)}} 

(FETCH 'IP U 01 )J returns 11 

(FETCH 1 (P if ?m return* f(P (f aU K (P If bill 

(FETCH iP 71) returns HP Ef aj ] h [p (f t?))» (P eJI 

(A, nO'l* OP notaton! as in flACUSP <neon, 1374;>, M + S-expressfan" is an 

abbreviation fcf " [QUOTE S-expressfoni,“} 

SfjAiet . ] *i*an by "natch" the! (here i* gome mapping* from 
Occurrences of "?' in the fetch pattern to S-e«:pre**ion9, which dak-ss the 
fetch pattern equal to th-fe item retrieved, [n practical systems* the 
matcher is sore complicated. It may be thought of as a uni fi cal ion 
algorithm <Robin*pn* 19B5v, in which labelled “don't cares" play the role 
of variables. Par exaspip* {FETCH ‘ID 7X 7K1) should Find. (C a a) but not 
H) a 0 ) , I ulll more or l&$$ ignore this- kind of matching in this paper* 

However* there is an associated peculiarity I dust mention. Dur 70BH 
bust a I leu "don't cares" in the i ted records. This is becaui* Users 4 -rill 
want to be able to model propositions with items, and need to have fren 
(universal I y quantified) variables in i tens like £1)5 MAN ?X) p {MORTAL 
7K3). For our purposes, this may be treated as ({]£ m 7) □ [HGHTAL 7)]* 
Ue want to be able to find it with a fetch pattern like (? 5 (AORTAL 
FRED)). 

Ii-A.2 Multiple 3ais Bases 

in Af applications, ther* are Eaveral reasons Id hflva many different 
date basGE tn select from at a given time, However, these cannot be 
represented by anything as clumsy as seoarats files pr indexes. It has tp 
be cheap to copy an entire data base irt ouch a way that routine changes to 
the copy (additions ard roHqvals of data! ds not affect the original 

This is done hy repressnting a data ba?£ as 3 list cf layers which 
represent the history of changes to an original OHpty cut* Uith each layer 


are stored the deferences between the data bases that contain It and those 
that do not* Such a list Of layers is called a CXt {pronounced lima 
“context, “ Ccnniver’a misleading torn), 1 Mill reserve the term "data 
bass" from rtew on to mean tbs set of all data in all exts. 

For cw&np|c, a problem solver nay rode! tin® aa a sequence of extss, 
each with ode care layer. ]n CXT1, It may have \hn items ION A BS, [ON 0 
TABLE), (DM C TA9LE) , re erasartting the scene of Fig. 


IM _ 
IB; 1EL 

CXT1 - U) 
(a) 


__ I'M 

IBI L£J 

CKT2 - <Z 1> 

(d) 


Figure !1.1 

in con temp j at mg an action, the problem sclver may tiakr $ ireu layer [number 
21 f and "copy" CUT! by poshing the new layer onto it, making CXT2. Mow If 


it REnDVts CCM A B) and ATiOs [ON A C) with respect to- CKT2, all other flat a 
are undisturbed, and CXTl'e contents are unchanged:. The two changes 
remove ION A B) M and 'add {(fl A Cl" are associated with layer 2+ and thus 
with CX;2 and all further c«te derived fron it, {Another use is to model 
"hypothetical'' worlds in which some aosunption is assured temporarily. For 
more detail- on implementation, see ^ncOerpott and $Ufl*»an T 1973>.) If the 
layers of two pxts Cl and C2 are such that all the layers of Cl are in C2, 
Cl is nailed a SEiper-ext of C2. which is a sufr-cjrt of Cl, In my example, 
CXT1 is a super-ext 0 f C!K"2. 


Layers are well behavEd enough so that new cstts nay be formed' by taking 
the union of sets of them, (This fact is often overlooked in discussions 
of CMts, so that a 'ext .tree’ created Cy adding new layers is taken for 
granted,J f will deal with this feature at length in Seot► IV, 





EE.A.o Derived Data 


Thi structure } have describecT it i nterrva I | yi complete* but ] must raise 
ene hare superstructure upon it to fit it into the usual tradition, 

Planner-type data bases a I neat aluayi have built-in mechanisms for calling 
simple "deductive" procedures. One kind, the JF-NEEDED method or 
consequent theorem, is supposed to work with FETCH to generate "virtual 
itens" which might not be thera until the FETCH happens, t mi I I not sau 
much about this type, since the issues »n implementing It are fcostiy 
centered on Hpu tp control a generator co-routine. 

The Other kinoa are implemented as software "interrupts" of additions 
and removals of items matching a pattern. These are Connived a if-ADOED 

and [F-REfKlVEQ method types. For eKenpIs, in pidgin LISP for the IDBtl* we 
ttigfit urite 

11 f-added (i e rose ?r) 

(add f" (color tfr red)} } 

[The ! "-notation Indicates "(Hja0i-qugtaticin. H E the value of !" $ -express Ion 
is S-expression with subparta narked by V replaced by their values.( ft 
procedure like this is present in a c*t just like the Etens it interacts 
with. ' (ADC ite ml tails (FETCH f p 1IF-APDED afte.ro ?H r and evaluates the 
third a let in the returned Itema. 

These interrupt methods tend to be used in tuo ways; to implement 
useful forward deductions* gnd as genuine interrupts repuirin^ non- trivial 
attention by the calling Program* The turner is i I lusfratfriJ by my "rosea 
are red" example; the latter, uy a method in a problem solver that notices 
When a "pretested" goal ia being clobbered, {Cf. *Suseman ( 1375>J Tha 
latter, the "true interrupts," pose n$ problem from a Data-base management 
point -qf view. £ Mill ignore them. 


The others, the "data oar-i vers," fall into two classes* 1 f-gddeds that 
add consequences of their trigger items; and i f-removedB that clean them 
up, like; 

(H-re*l*vud (is rose 7H 

Hrenova !" [color dr red)! 1 

] Mill idealize this situation by assuming one kind of data deriver, an 
Item of the form 

{ AN7EC-[ TEH f npo t Qtil put). 

(‘*Antec' , ‘ is descended fron Hewitt's <1371> "antecedent theorem.**1 For 
eh asp I e, ue can have C ANTEC-E TEH (IS ROSE 7R>. [COLOR 7F RED) I- When ADD 
adds an I tea matching fopwt, the matching substitution is applied to 
Output, and adds I t, too. Rsmbval works dsynnsstr 'wl'U- In thro scheme, 
ADD makes a net* that the support t pc, ggy, jCOLOR RBSEtf71 fl£0) In this ext 
Is the presence of OS ROSE RQ5Efl71> and HNTEE-ITEM US ROSE 7RI [COLOR ?R 
REDD. This note Is attached to all three items, Nben REMOVE flushes an 
Itch, it flushes as us I I all items supported by It that have no other 
support. This obviates elean-gp Methods. 

The detail* Of this somewhat arbitrary octieae arE unimportant, but the 
notion ef di at itigui sh i ng true- interrupts from data deriver 9 has been shoun 
to be useful in practice, 

1].E t npIesehtation 

[fi the TC3M, there are three entities that stand bit tween the caller of 
FETCH and the datat the indev, the e*t filter, and the rrateher, All 
previously-mentioned data are in thE Index fuhich behaves much like the 
LtaP atonic symbol array I, but a given call to FETCH .should return only a 
handfuI, (See Fig., E] .2.) 
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Figure 13*2 

The last Step Irt F : 'g. 11.2* which rejects (P a b) when the fetch pattern is 
(P ??t ?n) „ is not of Interest to us, l+e focua on the others* 

Notice first that Otte are implemented backwards* Th* iyetert does not 
look ir a cht far a datura: it leaks an a da tun to see if it is in the 
current c«li In the indgjt, an i een ifi Stored aa an ftem datiirr, of the fort* 
[ftCJ? -Cnrarkers-I, The Cmarkers* one per relevant layer, gay which evta 
the I tem has been- added to Or renovad from. *sHenerhott and Susaman, 1973> 
These markers are compared with the Current c>rt {the two lists being kept 
aerted py layer nunoer) to sec if the datum la actually there* (Datum 
"proper t i es, 11 such as what data support a eaturn, may a Isa be stored in the 
cmarkfrrs.) 

,' he marker Systen treats datum presence aaynmetricalfy from absence. A 
marker- for a eM 'aycr is attached te a datum only if the datum was added 
w.th respect to the I flyer. If :t is later removed from the layer, the 
Harher is just throun away., however, if it is removed In a ekt derived by 
pushing new layers onto the original une, the system must GearsT the 
marker* For example* If rQN ft 0) jg narked as being in layer 1, then 
REMQVmg it In c*t (2 2 1] narka l£N A 6) "present in 1 aa cancel led in 3+ * 
Any Z*t which includes layer I 4-ii 11 contain (3N A 9) unless it includes 















layer 3 also. 

Ihis 'backwarda" cut scheme i a efficient enough for email data bases, 
but far large ones It COuld be embarrassing, This is the topic Section 
IV, 

Even* for small data taiSa, it is inparfant that tbs index-f etcher 
reject ail Cut a snaJ | Net of candidates. {Hatching tends to be an 
expensive uay to reject an iten.l There have been many ways suggestti-tf to 
do thie, The following ia a composite., 

Every itel* is indexed hy its feature-S, A feature |* a pair <atcnir, 
position*, For sxanple, [P If ?)} has features <P, CAR>, ci, CAAQFlx, <?, 
i,ftDADR>. ■, The positions are. of course, -‘eelty represented: a 3 oil strings, 
but we uiM LISP CAR-CCfi ccnpog I 1 1 ens, with 'CR' standing for the empty 
string that repr*Bsnte the top-level portion.} For each feature, there is 
a bucket of l tens with that feature, {This nay Os implemented using a hash 
tab 11 +) For example, the <P, CAP* bucket wil] include items [P a), (P [f 
b) J , [P m h b p) , i f these- are present in any an the data base. The 
<?, CAPR* bucket will contain i tuns like iP ?x3 , [fQ el ?y|, {R ?y [f C H . 
etc. 

The function itiOEX-FETCH takes P pattern and a position, and returns a 
liat of candidates that contain. In that position, a structure which could 
match patten. Thus ([NCEX-PETDH 1 .pattern 'CR> returns ail i tans anywhere 
which could match pattern, [f the pattern jg "?," all items suit bp 
included; this is indicated by returning a symbol for the h un. i ver sa 1 
bucket," Otherwise, the rg turned list must always include the members Of 
the <? t position* bucket, [f the pattern is an atom, tbs union of the <?, 
poif E io/fS' and *-Atom, posit ipj?> buckets is returned, loathing else could 
no Ich.) 

When tbs pattern Is hfib-*to«nic f IfCD<-FIICH must examine the left and 


right "e* tens i ons" of the- current positions that la, thE tAfl and CDR 
relative to the current pdiitlwn* For example, C(NEEX-FETCH ' tP ?] ’CRT 
hMI call (INDEX-FETCH ‘-F- 'CAR! and (INDEX-FETCH + i>] f CQRt). Tha pattern, 
of cell a may bo represented by b tree Laomorphlc to the original latch 
pattern! 


*P, 


<?, Cfl> 



Uhl (VERBAL JI1L, EUCfl* 

BUCKET <?, Dm 



(el Lii) 

Figure St.3 

Remember that at each "care" node the "oon 1 t CBT«" bucket must he unioned 
in, But f ■ ret, what does thE system cto with the results of tho tuo 


subedits to iNHEX-FETCH at each levels Conceptually, the desired result »o 
their Intersectior, the set of all iitena which share the feature 
combi-oat 1 dus on each subtree, (Thio is actually done in the current 
tohhivcr, unbar some c i reuwst antes.} However, it Is too painful in a large 
data hese with many liens, The output of the intersection may be quite 
email„ and it wifi have to be redone every tine there is a FETCH. 

A cheaper procedure, most Of thB tine, is just to take the shorter of 
thn two Duckets at each stage, This leaves in soma losers, but it may not 
be worth it to get rid of then. For example, given the fetch pattern (P 
AT + there may be many 5 tens starting with P T byt only a feu with an A in 
Ihs CADR position, r-qr ngu, let us assume it is cheaper to filter out the 
losers with the cwt and matching mnebamsns than do intersect Lons tilth 
arbitrarily long tuckets. 

The mechanism ao far thus selects a branch of a tree like Fig. II.3(al, 


and relurns th-e union of the buckets on the branch. {There la no reason 
actually to tfc the union until it return* f 0 the tap level,} Notice that, 
since the tree is searched depth-first, the search daun a branch can be cut 
of f as spun as the Suh of the sij*i of the buckets encountered e-Kceeda that 
Of the host previous branch. 

an a very large da‘a base, taking the shorter of tho two Competing 

bucket uni one tiay not d( enough. If there are tun competing features in a 

fetch pattern and each has a la-go bucket, hut the i n tersect tan of the fug 
is snail) it la costly to return either one or their {expensive to cuinc 1 riJ 
inteeeetion* Tho usual so I ut ten le to tine one of the offending buckets 
and break it down by features again, te produce corresponding 

to pairs of features. This amounts to Computing all non-tbpty 
Intersectipns rn advance. Fpr exanple, if the indexed i tehS arc t(P a-, |p 
tsl, {R c) r (P dl« [p el, EQ al, IR a), ES 0 }„ {T alf. then the «R, CAR> 

bucket U HP at, IP sl t [R c ). IP d) t (P «)}, the <a, CdDR* tucket He HP 

a}» IQ b I , [R 3 I , {£ aJ t (T aTT T and the <NH,t CQDH> bucket includes pFI 

the data. Tne best tUsOEX-RETCH ’ IP AI f CP] pan do ia find the two S-l'tO* 
bycketa, fisEuning 5 exceeds the threshold of bad taste for bucket size, 
the indexer decides to break down the cF, CAP* bucket. 

When it is cone, tha Inoax contains the following: 


Backets 
xP, CAFt> 

Swb-bi/cfce ts 

<P t CAR* HP a], [P b), (P c), [P d>, fP nij 
<a, CAOR> UP aff 
xb, C*DR> UR bf! 

<c, C'ftDR> HP cH 
<d, £AOFr> HP d)I 
«a, CADFi > HP *}] 

<HJL tOOR> HP a] , CP a), (P C J„ (p o) „ (P sit 
<Q, CAK* ECd alf 
<R, CAR> HP alt 
*S r CAR? HS si \ 

<T + cap> nr an 

<a t CA£fl> HP an [Q a), tft al, [S al, (T aH 
<6, CAUR* MP b>) 

<c, CA3R> EIP e)l 
<d, CAOR> E(P d> F 

<a, :adr> iep *u 

«N1L P C\m> C IP at, (F b>. CP c?, IP dl. CP a), CQ at, 

CP at, fS at p CT aU 

Figure ][.4 

Thia procedure is called rthAshing the xP, CAR?- bucket, From now on, any 
reference to the <P, CAR* bucket <6 handled by finding the re levant sutj- 
bUCikCte given the rest of the pattern, and using these inutsad Earcd 
ignoring ths corresponding buckets in the main index, which are a I -lays 
bigger}. Consequent ly t bucket and sub-bucket keyg must be genetsi i zed to 
sequence a of features, like «P, CAR*, <a, CAtfU>, In thia way, tho index 
tends to organize itself Into efficient sw-indexes, If necessary, tbs 
process *ay continue, generating sub-auh-indues, etc. (with the sub-incew 
ccbi i ny irure and mo* - # to resemble QA4 1 s d i acri ■ i nat ion net index system 
<Rullfaon fit. e7 Ti 1972*). 

One drawback to my scheme S» that aone buckets created this way hi I f 
never be used. For example. In the temple index shown, the onEy reason to 
have a bucket for the feature *a, CAtPb |n the main index at well 3s in 
various aub-Indexes is the possibility of a future (FETCH 1 (? al) + If tho 
user is aure 1g sure at I future fetches ulH mention P, Q. R, etc + 
explicitly, tha bucket i a useless |* him. 


The solution 13 to sate the searching, indexing, and un Inde*ing 
routines data-dri re,7 in the sense that their default actions ngy be 
overridden by routines dependent an the data being handled. Th* simplest 
scheme Is to associate special indexing routines uith the CAR of an Item* 
For exanple* the routines for a tame I i ke P might specify skipping Indexing 
and fetching on the CflDH position at ail, in the caae uhe-’-e gl[ FETCHes 
have a in the CADFl position. 

Data-driven indexing is helpful in other uayg. A form Mke QA4’ $ (SET 
eIementl ... gleirantZi may be indexed In ouch a uay that ail the elements 
Ore associated with the satie [CfiD3) positron. Then a special set matcher 
can be used to process FETCHed sets, [f the set {SET A 31 i B fished for 
uith tha pattern (SET 3 . ?) , it Mill fie overlooked unless something like 
this is done* A more prosaic example is avoiding looking for <?, pos> 
buckets When they are known not to exist. Ordinary PFanner and Conniver 
programs usually have great Quantities of dull data Mke (VERTEX 5,5 -4*7}, 
Where it is waateful to have to look for the enpty <?, CACFl> and <?, CADDR> 
buckets. 


Ill THE SYMBOL-MAPPING PROBLEM 
III.A Introduction 


Difficulties of scale with the typicaF date-base nanager appear in 
connection with Fahlrsan’s <157$s. ' gymoa I-rnaopi ng" crcfilen, 

Suppose l tell you that 3 certain animal — let's can hilt elude— re 
3n eleohant. You accept this simple assertion and file it away uith no 
□□parent display ef hcntal effort. And yet, as a — gulf of this simple 
transaction, you suadenly gpoeir to know a great deal about Clyde* If I 
say that Clyde climbs trees or plays the piano or lives in a teacup, 
you «l I I imnediately begin to d 0 ypt ny eredipiIity. Somehow, 


*elephgnt" is serving a? wor* than a here lace! heret it is, in some 
sense t a whole oackage of properties and relat ionsh i ps, and tha,t 
package can de delivered by means of a single [S-A itateumt. 

in pr;ncrple t such behavior eon bo achieved through the use of soma 
form of deduction. Each fact is a seoarate entity, and fl-ou facte 3 r B 
produced by knocking together two old, ones. Thus, if «e have "All 
elephants hav* ur inkles' 1 and "Clyde i* an elephant", ue have the right 
to oeduce that Clyde has wrinkles, tn pro form dr another, this has 
been the standard A] apBroach, 

Byt having the right to deduce soma fact ie not the same as having 
the job dene. HuCh ingenuity has been devoted to the search for fast 
deductive mechanian?, pyt ths problen renalns intractably 
comdinator 'a I, 

There are tuo problems described hares bringing rh large amounts of 
typ i cS }— e I ephant itnou I e^ye, with, loosely speaking, tuples eleohant "bound 
to" Clyde! and detecting imp IauaibiUties t"CIgde cIimps trot®*) w hen they 
come up, 

The second is a serious problem, whose solution depends on the 

i ' 

particular dona in it l? drawn fron. We should not expect a raw data-base 
systom to perform unassisted task? like choosing betueen tub 
interpretations df "h*“ in, "The monkey screened at the elephant as Ihe 
climbed the troe.“ 

But ) fool that the first probien shoyld bs made sinpIe by such a 
eyetem. Let us examine the three options tor solving it that 3 Planner^ 
type system offers ust 
(1.) Ue COuid treat 
tis elephant ?«] 

3 {color ?x gray) a {size 7x pig) a [diet ?x plants) 
a tie namnal 7k) a , 

as a large AH TEC-I TEH (Sect, N.A.3J. When (IS ELEPHANT CLYDE) is added to 
a cNt. the new I y-censed facts (CDLCF CLYDE GRAY), tSiZE CLYLE Bid, *** are 
added as utlf. Unfortunate-ly, thifl ie likely tp uaste ? lot Of time and 
apace, Since the number of facts known about elephants is large, but only a 
feu of them arg going to be used on any occasion. Further, facts like {IS 
NAMMAL CLYDE) are likely tc cause nany morg facts to be deduced. 


i2\ U a could wait until a fetch of tCCLDR CLtfDE ?CI was at tested, and 
profiosc {IS ELEPHANT CLYOEI as a subyoat {via an EF~WEED£0 msthod or 
consequent theorem!. i cu^l y 4 however , there are many equally goad- 
loohing Skoals* like (IS ROSE CLYDE! or lOCCUPATION CLYGE tB [nNEY-SWEEP > , 
fer the ays ten to uaale its tine going through, 

Another approach like this is to search through all things that Clyde 
JS when hi S color is wanted, looking for an assertion like fCOLOR ELEPHANT 
CRftVJ, For this search to n*Sn anything, the IS reiatior ul I | have to 
carry the burden of representing host information in the syaten. indeed flfly 
piece of inf or oat ion from which inferences could have been made fin this 
ease* about COLOR)* For example, one would oe forced |g gay {IS CHtHNEY- 
SUEEP CLYDE! instead of {OCCUPATION CLYDE CHI HNEY-SWEEFJ t or the item 
{COLOR CH]ftNEY“Sk£EP BLACK! uoulcf not Ds found. Although the approach can 
be useful when the use Of [£ is car*fy||y tailored for a specific 
application, If used profligately It turns "I IS" Into a mere syntactic 
sysiSjoI , a left bracks! Ilk* ' which indicates only that son* 1 nf srenca 
might have Been araun from what it da limits. The method than becomes 
"search through LaltibstJ everything known about Clyde to see if a COLOR 
turns up," which could not be efficient on a noma) computer* |0f course, 
it is an open question whether all useful AI inference can be made to fit 
the ■{!£,.,) syntax. Cf* c|4oods t 1975>! Another difference batwaan "IIS* 
and M |" would be that IS presumably would signal that Its second argument 
is uhat the assertion is "about," For exsnpte* jf hy accident we had {IS 
(DCClFATlCfl CLYDE! lHEMNEY-SUEEPI, this fact would not be "about* Clyde, 
and would not be considered, 

(31 [f the rl^ht-nand side of the Implication could bs rep-esented as a 
evt layer or set Of ;aysrs, we could just merge these layers int© the 
current ext. The problen with this is that *CLYDE" does not appear on the 


right-hand aide, He ready want a 'closure' Of a cut* analogous to 
cJosvrea in language like POP -2 cBuratJlf at. al, t 1S71> ar>d LISP l.E 

cLavin fit, aI Pf 1B6£> 11 in which a free variable like ?X i* hound to CLYDE 
with minimal cost. 

Approach (31 ia the one we wi M explore, tut it i* worth examining in 
detail why a ConnivEr-aty I e ext ul M not work. The user can't replace 7X 
With something like THE-ELEFHANT. for two reasons: there night be acre than 
HJfio elephant? and he has to begin referring to Clyde a? THE-ELEPHANT. This 
second reason ie a problem because some of th* facta involved night r®for 
to ThE-HAtTIAL? and because two processes in the user's probleri abiding 
program might have trouble eenmunicating with each other* each hawing 

focuased on a different sspact of this creature and chosen a different 
name, 

Thses facts hurt, not Ortiy because they blunt owr attack on the eymbbU 
napping problem, but hecawss they appear to pulF the rug out from under 

efforts to implement a Mmskian frame ayatem tfliosky:* 1974> in a Planner- 
Jihe way. 

However, there Is a way to debug approach (3), which makes a Planner- 
type data base a viable candidate (or a substrate for frames* 

III.B Potential Item 

Thfl idea is to inoe* an item like (COLOR TO GRAY) «lth 7X aa a 
var "able, hut mark it as a mere pofcejjtia? ftem. Uhtjn FETCH finds such a 
potential item, it is to "snash" it with valued- of TO corresponding te 
creatures Whose elephant-hood have been asserted. [n addition* the Item Ie 
to Oe prtsgnt -onJy in exts including the ’eiepnant layer* N which is the 


rebre-a-efitait i on of the r ight--hand^s «ob of our I erg® imp I i cat ion. This- (ayer 
ie included only as I dog aa there ar* elephants around. Such a layar {or, 
more general I y, sntira cut), containing potent ial itens referi-lpg to 
eIaphant , is Called a packet, after FahSman <1973> and ttprcui tL374>. 

In Dther uards, [ an trying to make the system "cheat" 6y substituting 

values for variables In the right-hand side Of a large implication before 
the substitution to do it with is kneun. En lieu of that substitution, a 
set Of temporary bindings to dumny quantities like {^ABSTRACT ELEPHANT! 1 b 
used* Since the indexer is data-driven. It can he Instructed to lnde* 

[PASSTRACT , , + ] 3b a variable, 1 Mill indicate this hirtdi ng to a ouasi- 
eonatant with the prefix " ?ifft, “ 

Ae an exanple, let us ewanine the elephant packet, It contains the 
pokentlaJ items (C0L3R TftBELEPHANT GRAY) *, (SIZE 7tftfELEPHAfJ J LARGE]*. etc. 

■. j tj 1 I 1 nark potential i tens with an 1 ) The ays tee g-lveo the packet an 
atomic fiame t like ELEPHANT-PET, and treats i i as a predicate with the free 

variables aa argument*, {ELEPHANT-PIC? 7E1 . This allows it to process items 

like (ANTfiMTEn (IS ELEPHANT ?X) I ELEPHANT-PICT ?HJ), Uhen {IS ELEPHANT 
CLYOEI 19 gddsd to a cwt, it triggers the adding of [ELEPHAWT-PkT CLYDE). 
The system indexee this i ten as usual T hut notices that ELEPHANT-PKT is not 
an ordinary predicate, but a packs t t so it fflcltrdea the ELePHANT-PKT c *t 
layers in the current tKt* and notes the substitution of CLYDE for ?X. The 
Item [ELEPHANT-PKT CLYDEl is a packet-closure. 

Now uhen a FETCH of any of 

(color clyoe gray) 

(color fred gray] 

(color ?z gray) 

(color ?z ?c) 

(color clyds ?c), etc. 

is attempted, the potential i ten [COLOR TWfELEPHANT CRAY]* will te found 
and dCtueJfirfld to (COLOR CLYDE CRAY), This neu Item ulll appear in the 


local c*t, Of course, not in the elephant packet. 

It t[S ELEPHANT RALPH! is added, the syaterr adds IELEPHANT-PXT RALPH!* 
On thg nut tgtcn of (COLOP .*,). both (COLOF CLYDE GRAVJ and [COLOR RALPH 
GRAY) ui 11 be generated, [The the ays ten finds all bindings, of 
course, is to FETCH [ELEPHANT-Pia 7K] and use the retrieved swbet i tut 1 one. I 

Df courae, this it wasteful as It Stands, since the systen doesn't pot 
bep&ntl the potential l ten, ]n fact, it must replace it with its 
actual i 231 ions. 6 E f ny$[ be cautious enough to remember that It did that, 
bp it c3n reactivate It In case another elephant appears.) 

Finally, H d3t3-depenS*hCy notes" like those of Sect* li.A.3 *uat be 
attached to actualizations I lk 0 [COLOR CLYDE GRAY], ap that if (IS ELEPHANT 
CLYDE) Is removed from the ext, the atlua I izat ion a can be removed, too. 

And Lrhen thg iaat (iLEPHANT~PKT ...) packet-cl gsyre- |g gons, the layers of 
the ELEPKANT-PKT fnuat be excluded fron the current ext. 

III.C Further En&el|3shnents and Comment!) 

When a packet i s be i ng by| 11, it Is desirable to be able to Bake 
deductions trow 3 t& potential item* aa they are added. For example, It la 
Important to ho 3& I s to induce a closure of the rWIIIAL packet in ELEPHANT- 
PK7 tihen it is bui lt. So at this time the systen must treat 7#iHfELEPHAtJT as 
ati actual con at ant that can take part in matches, etc. IJhen (IS HA DUAL 
?#£ELE?HANTl is aeded, Lt is not treated as $ potential I ten, but causes 
S flAtlHAL-PKT TSfOELEPHANJ) to be included in the EUEPHANT-PKT ext* Thre ie 
not juat app I i cab He to t ran si live IS deduct! an? It ugrke far ordinary 
antecedent reasoning, and for inclusion Of Other kinds of packs to Esuch 4 b 
the TUSK packet , tuo closures pf which Might be included in the elephant 


packet). 


Thus there will be de-bencanc i es ahong the potential items. For 
example, (CAN ?##ELEPH*NI (HEDE-IN FtLENG-C ABORTS) )* might ae marked a< 3 
con sequence of (DDL® ?tf(!ELEPHAN7 GRAY)*, This ni l I net matter i, w eh unleaa 
mm user of the packet tries to remove the COLOR Hem for a particular 
a) apnant. This ta intuitively a corapU* operation, sines the reason* f or 
atid consequences of an alteration. to a structured data base may ha 
convolute^* However, sen* simple bookkeeping egn be dens by the data base 
manager. Uhoh (COLOR CLYDE GRAY) is removed, (CAN CLYDE tH3Dt-IN FILING- 
CABINETSt) should go, too* The situation may be represented as follows; 

ELEPHANT-PJ<T 

{color ?tftfelepnant gray)* —-> [cap Iephgnt 

Ihidb-in fiIing-cabI net a))* 

I 1 

i i 

| (a I ephant-^pke (eiephant-pM 

! Clyde) S Clyde} 

! 

CUEif)£Nr I I 

CXT v y 

(color Clyde gray] - - --> (can Clyde 

(hide-in fillng-cobinets}I 

Figure I][.1 

Even though the broken lines are not yet present, they *iu*t be put in. 

Than (CC-^OR -L'tCc jRA v ) and its Supportee? can ba removed as described In 
Sect, II.A.3, The actualization [inks Must be marked So that future 
fetches Mill know not to add the actual I zers again. This process is called 
detaching the I te*i OLCR CLYDE GRAY) from Its packet. 

This BBchanisti [which in its full generality is hOra couple*} can be 
deed for building packet® oefinetf in terms of ether packets plus certain 
exceptions, for example, AL3]^ELEPHANT-PKT may be defined as the 
ELEPHANT packet Cxt with one layer added, in which [COLOR 7^Ai_0lND- 
ELEPHANT GRAY)* is removed and [CGLflR ?&SALEIN^cLEPHANT WHITE}* is added. 



Sf ns AiB]NO-ELEPHANT CLYDE] is added to a cut, a FETCH of fCOLOP CLVDE 
CTAY] find* the following situation} 


ELEPHANT-PICT 

tDoior I ephsnt pray)* —-> I can ?dAg|ephanfc 

Chide-in f E 1 i ny—catj i net 3 ] J * 


] c-— t e I ephant-pM ———> 

??AaIbinc-elephanf) * 


I 

^EXCEPTION* 

I 

ALBINO- ) 

ELEPHANT-PKT V 

[ (color ?Ptfa I b-iirtp-B I ephant 
g.ray) *AB5ENT*1 


i 

|*EKCEPT 1 M* 

I 

] 

V 

[(can ?flilfa I b i ng-e I ophan t 

Chide-in f i ILng-cab■netsI 
*AgSEHT*J 


CURRENT 

CAT 

CallbinD-elephant-b*,t Clyde) 

Figure \\\,2 

FETCH finds (COLOR ^tfELEPHANT CHAV)*, but in attempting to find all the 
uIephant b, it notices the aEHCEPT I ON* Ginka through the ALB]NC-ELEPHANT 
packet, and leaves Clyde out. 

The 'potential ite*i F 1 tipi o-nentat i an of pgckel>C I Oeuree haa the feature 
that Et Is Independent of a particular relation Eike eet-incluaion {"IS"), 
For swamp I g „ a large implication Ilk? 

{loves 7x ?y] 

d {jealoua-of 7x ?y} n tadmirej ?k ?y] 
a {habitually (together ?w ?yf) 
a (when Ido ?* (for-some movie 

Elatibda [n> [attend ?x 7 b))}] 

(presorted ty (accoTipan i es ?lj ?k)> 

A nit 

ten be handled easily by creating a 'love-affair' packet, containing Item* 
like {JEAL0U&-DF ?tf*#LDYE3 7#PL0VtC}*, !n fact, inplication itself Is only 
incidental Iy connected with the use of packets. Any large conjunction of 




I teres sharing free variables may be represented as a packets 

If the reader is feelmg optimiStic about packets, hers are soma 
caution*. First, a-though the 'semantic baggage" problrn domin't look bo 
dad any mare, the other problems Fahlnan is concerned with are not attacked 
directly by my solution (e it* ]n particular, typical consequence* of 
somathing 1 s being an elephant are not automatically uaabla In recognition 
of elephants* Instead, elephant recognition knowledge ui II be contained 
in + say, the jungle packet, w th ths typical fra Tie organization. [This is 
as good a pi gee as any to point out that ny absurd ex ample* in this chapter 
are not representative of useful elepnant knowledge. tut only I I lustrat* 
formal problem* and solutions, I 

The biggest drawback to my system is that it relies on a packet- 
C to sura* such as the one expressing Clyde's e I eohanthood, being a temporary 
structure, frequently adeed, renamed. Or abandoned.. Otherwise, i f all 
known in fornation about all krnjn elephant* is always around. It mi I I taka 
too long to smash a potential item like fDULQR ?ttdELEPHANT GRAY!*, and most 
of the products of the effort will be useless. Nets that there is realty 
no difference between the potential item idea and a particular kind of 
consequent (]F-NEE0E0> reasoning* Thia reasoning is, "to find Clyde's 
color* find what types of colored entities are around, deduce what they 
are, and see if Clyde is one of there,, 1 This deduct ion i* done efficiently 
by following the actual]zatIbh pointer a down through layer* of Backet CCf, 
Fig* I ] 1 -2), and w* gs fc the exception me chan i *n cheap By having the system 
do it this way, but the conbi nator i 91 limitations are the same, I] believe 
that even in pad case* this net hod it superior to the other approaches 
isshtpOhed at the beginning, of this section, which Involve proceeding from 
Clyde to GQL'3H. fly approach always Sticks to rathuays which are guaranteed 
to sad to present CO..0R Items [or except ion*}, Furthermore, once they 


have been found, the potential item the! generated (ham ie gn-ns. I 

In my Ouh research an electronic design xJlctlernq11, 1974b>, this 
3 »o.guptic.n that only a feu closure? of a packet mi 11 ‘be around at one time 
■ t justified., A closure of the 'af-ampli f ier" packet Mill tie made and 
included in the ext ter a radio receiver uherv an at amplifier is nesded* 

SLrt it will not be visible in cxts for later tasks. 3 think this sort of 
Organ i 2 at I OP la typical of problem solving systems tend is crucial to the 
frame theory that most people accept], but there nay be situation a uhere it 
is inappropriate, [See Sect. V.> 

i t[.G Jmpiamen tation 

Although it is too early to tell hOM Successful my packet schEne pen 
be» 1 have i np I enentad a preliminary version pf It, the PAWACEA data Page 
nandger. ]t Is based cn a nodifietf Connivar data base with a data-dr j von 
indexer Put no sub- i Udexing, which It uses as an aesneiative memory and 
item uniquizer. It buff era the user from the actual data by smashing alt 
potential 3 tem data a? described above. 

Ttra PAMAClA system Itself is quite small, although my programming style 
has made the TDBT1 and others system programs it relies on too big and 
clursay. Ptest of the comp lenity ef PANACEA >& in the routines that maintain 
data dependencies and exception links in a consistent state. The actual 
POtsntia■-itern smasher occupies two or three payoa of LISP code. It ia 
hard to tell hou fast it runs at this tirrE, since it has not been fully 
compiled, and since there are knew inefficiencies in the Support routines 
ton I t+ The beat sure is the number of matches (uni f i cat i ons} and 
van i an 1 ! - tea ts done in the Course of a felch, This number depends on the 
complexity Of the packets involved, but i* much smaller than it would be in 


an ordinary deductive scheme. 


The Item representation E an using is that of Beyer and tlooro ’cl972> 1 
id which an i ted is stared as a pattern and separate substitution, By 
analogy with function closures, the stared substitution is called the 
"environment 11 of the i ten. The original purpose uaa to save storage and 
enhance readadr J ty of formulas by representing a deduced (omula as, a 
pattern pus oi ridings acquired during Its deductive history, Heini&vor, It 
haa the additional advantage far PANACEA that i| enables potential items to 
da actualized by just iwitching their environments before adding them* 

Unf or tunatety, this S paid for in other uayS, since special tunc 11 one have 
to be written to manipulate items Stored th'ia way. 


IV INDEXING BY CWTEXT 
TV*A Structural ]ndc*ing 


Except tor cancellation, a pnit behaves like the union Of tha ssts of 
data represented toy ita exf layer** Thera fora* it is a particularly ugly 
feature of the classical implementation tha: it finds all the data likely 
to natch a pattern and throws away the ones, not in tha current cut, instead 
of taking the union only of the relevant acts.* Excluding a layer frora the 
current C*t only naans hiding It from yourself, not frofe the data case 
machlnBry. ]t has aiuays seemed that these facts make c*ts fit only for 
toy prop lens; that domain-dependent data structures or multiple CPU’a 
would ae needed for large bases. 

■he p rob Ism is especially pr easing in view of pyr rtl I I ingnaaa in Sect* 
IU to have „any items 1 ikft ICOUM ?«ELEPHANT GHAYT* nornally hidden from 


vteu. Clearly* uc cannot afford to have every FETCH of (COLOR X ?Ci f i \ ter 
a bucket containing a formula lor every type of object h! th a usual color, 

The first alternative that cortex to mind is to rehash any inoex Sockets 
that nave gotten too big by the layers their d3ta are present |n,* On 
fetching* the system tabes each layer in the fetch ext and hashes its 
number to ratrieve the r B levant sub-bwtkats, if any* Then it unions the 
aub-bucketa. 

The problem ui th this is that moat layers will not mention any data 
matching the Current fetch pattern. Of 100 layers, most will mention tid 
COLOR 3a a or t ions, for txahple, This scnon® ia ati 11 probably an 
improve rent over the original. U te efficiency depends more on the length 
Of the ext than the busbar of patching data anywhere | n the data base, 
which ssema better,} 

[V*A,1 Hsrge History Graphs 

So far, 3 have been treating ext layer a a a pearls of data which can be 
strung as up will. This notion ie what leaves us with so little structure 
to use her b, where we need It', ]n actual practice* It ia possible to 
iflfloBe some useful discipline Oh the cuElding of exts without losing any 
reaI flexibility, 

Observe first that n me ty-n I n-e percent of all Cdhriver ext man i pu I at ion 
Is in terms of PUSH-CDMTEHI and PQR-CCflTEXT, I^USfH-CONTEXT adds a new layer 
on the front nf a ext* and FOP just gets back what you started with, tit 
s equivalent to COR*} Because uc uant to manipulate packets, let ut 
generalise PUSH to merge; take ti cuts, union then, and push a new layer 
onto the result. Merging ms w be used to include a packet-do sure in a cm*, 
(Ue are alec going to read an imp I omental i on of packet-closure exclusion. 


which I will describe later.] 

Tf we restrict ourselves to merging for the fine being, every ext has n 
parents Of irflich it is the Cfrfld. The process starts with the empty ext, 
with m la^era, called £, All one-layer exts ara etii Idren 0 * 3 (the result 
of a Perga with n*£?) . Eve-'-y layer is the first layer of exactly one cM* 
Over time, the structure of cwta Bight evolve | ik* this: 



Werse History GrfljJh 
Figyre [V.1 

Notice that there nag be bore than ons tierge qf the game tup c*ts* This 
■i... is basically irrelevant. and [will ignore it, allowing myself to say 
H the" Btrgg of two c^ts. The neu layer in each ext ie called ■ t d ptin&ry 
layer, of which it is the prfmary ext. 

The raerge history graph gives us aone structure to work with. Let us 
associate with every bucket in the i nds* its key paths, each of which is a 
(partial) descending branch through a ext graph like Fig. IV. L, (A. "key 
path" is a "path used as a key," not a vitally inportant path.. 3 A key path 
for a bucket must terminate on a e*J which is a 3 yper-c*t of the ext of 
each datum in the bucket. For example, in Fig. 1V.£, ^0, fl) , ($432 1}> 
would he a key path for a ducket of data associated with Cxts {5 4 3 £ 1), 
(11 5 4 3 2 II. and (S 7 $ 4 3 2 1). Sines a layer unambiguous I y 




identifies a i Is primary cwt, [ will abbreviate this koy oath to <0, 1, £>. 
Hitherto, ail dockets have implicitly had key path *0:» T $imcn the cwt of 
every item is a descendant Of 0. (An i ten may have more than one Oct, but 
thafe mot important.) 

Hitherto, however, such a bucket has often ocan able to grow without 
bound, N&u, when its ei£e exceeds some threshold, 1st the system rehash it 
into SUb-bucket4, each ef whose Key paths Is 1 longer than the old cna it 
tame from* For example,. a ducket may be broken down into <0, ls» + <B> 
2>, and k&. 3> sub-ouckBts. []f any of thtse is anpty, ft should be 
□kitted.] 

Npu say [P 3 } is in layer 1 and IP b) ih 2, ] will abbreviate this as 
UP aip IP bl^J, The system, can put UP aJ^l rh bucket <0, l> t put IP b 
in bucket -C0, 2>. and omit the <0, 3> bucket, Non when a {FETCH * fp 73) PS 
done in evt !& H + th* data-base manager can Ignore Soma of the {P. . , ) 
items a I together* It just "Inverts'' [6 1 i 1 C th‘< b may be dene in advance! to 
give <0, 1 + E> + and takes the <0 t 1> sub-bucket when it discovers the <0> 
bucket to have been rehashed. 

Clearly, this process Ls pxtsndaoie. Say that {IP clgn {P dj^, IP clg, 
CP fig, CP glgl are added to this bucket structure. AM nave inverted e*<t« 
starting with <0., 1,...>, so they ar* al l placed in the <E, i> sub-buckst, 
M&U a fetch of CP ?1 in c«t ID will yet [ (P aSj, (P c)g, [P d)g, {P a] ^ 

{P fig, CP 9 ] gl t all but two df which are then filtered out. This low 
ufficiercy indicates the list has gotten too big, So i t re rehashed into 
three aub-bucketsi 


Key Path 


Contents 


< 0 , 1 * 

<0, 1,, E> 
< 0 , 1 , 9 > 


IIP a) lF CP a3 ; } 
IIP clgp CP d) e l 

IIP fig, (P gjgl 


Now if the fetch is from ext ID t the system gets HP fllj, IP el;} without 


dirty filtering, ]f it i E from [£ li , 1| takes ths union of the <0. and 
<B t 1. &> sub-buckets to ggt UP alj, IP ei j + {R iU. {R dJg]_ 

So long u& only pushes ar* used to make neu evts. |he fetch algorithm 
{once you have a bucketI ig 
(a) invert the fetch cwt 

lb! uee successive prefix itrirgg nf the irweraion as key paths to find 
yewr ugy d&un the sub-bucket tree 

Cc) take the union of the 9ub-buckets found, filtering out items not 
actually In the fetch cst 

Part {OJ ui 11 proceed only go long as the as yet unseen sub-buekets 
generated by past fiODs are loo big for part Cc) to be efficient 

if merges are allowed, the algorithm la much the sake, accept inverting 
3 c(ft Mill not give a single key path. For e*anple, Irs pig, ]V*L, c«t (5 4 
3211 1 cpkb fIke 



I3i 


{£ 4 3 2 V 


Figure tV,2 


Therefore, we nuat take into account the likelihood of having more than one 
uayi to extend the key path to find sub-bucketa. 5n this case. If tha 
bucket is too big, the fe!ch*r bust look in the <0, Is-, 2 >, 3nd tff, 3> 
SUb-bUCka ta, and take the union of the reautts- Et these sUb-buckata ara 
themselves broken dacn, It.fluSt avoid finding and filtering the bucket for 
{4 3 2) twice, under Key paths <0, 2 t 4> and c0, 3 n 4>. Similarly,, the 
Sub-bucket for {£432 1) has th r, ea key paths, AI I the system h.ag to do 
ig compute the 11 i rivers ion" of [£ 4 3 2 It {again, ip advance, if desired) 


til (2) 13} 

i 

(4 3 23 

E5 4 3 2 II 

Figure !Vr3 

■nr one of the other arc-cte I et ions that turns F|g. IV,2 into a tree, Of 
count, the routine that first creates ar. anbi guous I y-nanad sub—buc ke t ruo t 
make sure that it ia painted to tram ail relevant plgcea. Than it will not 
matter which ia used to find it again. 

Before extending (.hiis concept, let be explain what its costs and 
pitfalls are, First, like any table lank-up scheme, it trades off the coat 
of adding an entry against the cost of retrieving it. 4le have to Spend 
more title and space adding an iten if we oust Inde* it by chi. However, 
memory is getting cheaper, and this schene will work with cheap masa 
Storage [Sect, IV,CK Fur thennor*,if necessary we can arrange tp expend 
the resources primarily on cuts Hike packets* which are tp be relatively 
permanent, fetched from more often than added to, 

Second, it seers as though it ecu id be a big nuisance to have to posh a 
nau layer onto a cut every title yQu Co a merge. Ifoy not just use th* old 
top layer, ■especially if you Vi doing only retrieval and no new storing? 

The answer it, you pan* | just wanted the cne-to-cne correspondence between 
layarE and cuts tor purposes of clear BKpoai ti pn, Since 3 t' g the complete 
Cwt inversion th-at guides the fetchor, it doesn't riattsr whether two c^t* 
share the tame top layer, [Of course, (up cute sharing a primary layer can 
have side effects an each other.* 

Th ird, there are "jsatlh&lepical 1 c^sss where this scheme t+i I I nnt work 
any better than the primitive union case (and practically reduces ta I t3, 



H 9 c«t is aha I Pan and wide, I ike this; 



Figur‘d ]V.4 

and its | tens are sparsely distributed anong Us layers, then «fl are back 
to unstructured unions. However, this is not a very ccmnon case, bscaus* 
packet'--1 i<s cuts tend to have been bui lt by merger, Thus the packet for 
"a I ephant" will include the samnal packet; as ul|i the oacket for "wombat.* 
]f both elephant ana wombat packet-closures are added to the current ckt T 
they uI M hot be chi Idrop of 8; 


e 



Figure IV.& 

tHare and later. I use, e,g,. "elephant" to mean the primary | agar of the 
elephant packet e^t.] In general. 3 expect there to be a tew "'utility 
packets" for a given domain {"body part," for Example, in one domain, and 
eiectrohi-c component" in another] that wilt serve as strong keys for 
pursuing sub-buckets. Even though most of them ui I I draw a blank for a 

I- 

given pattern, they will then hid* their sub-c^ts Ce.g., the cwts for 
animals or circuits whose parts they reprg&e^t] fro*i b B j ng uae< i ^gy path 
a F etienta at all. 

It nay be possible to avoid this prop I**, by artificially ensuring that 


no ext Has more than sojie jnal I number N of children. Whenever a c.vt 


acquires N+l children, the system could insert duimy layers between It and 
its children td group theo into fewer than N groups,, each with fewer than M 
jiCHbera. 3f the data are sparsely enough distributed, some of tna 
intermediate Cxte will drat, blanks ard save us some work. However. I 
predict this is mere trouule than it is north. 

A good remedy for many prablens of this kind is to store a complete 
list of the sub-buckets of a bucket and throw out the 9ub-buckets with bad 
key paths Instead of finding those with good ores. If the data are 
distributed so that they all fall into 2 or 3 sub-buckots out of a possible 
G33 keg path extensions, it is jjo.“tn looking at it this way. 

To see hou this remedy sight apply, consider the case of tho creation 
of a huge set of data patching a pattern, in just one ext that is tho 
result Of many pushes: 
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Figure EV.G 

The problen here i* that, when fetching from (4595 +- P It, the lodcxor does 
not know it i» on a ui id-goose chase unti I the last step. There are always 
lots of data ati II around down there. Or thera would be just cne bucket to 
be fiItored the eld way. 



This problen will be solved 'if ue nake a nets In the bucket for oxrt (1) 
that there is only one non-empty Bub-bucke( lor tub or thrrcS anyuhere 
bafOM It in the Graph, [f go, It is. bettor juat to keep that ducket around 
and test i ts key paths against the fetch ext. In fect T there tg no Wesson 
to build the actual graph until the nueber a 1 non-etiply buckets extends 
sortie lbresho I d, Ond then only the first coup I a of forks need to be built 
immediately. Consequentiy, ue can- modify the previous algorithm tn include 
the possibility that a bucket UBS,. sOy, four or feuer suh~buekotS, utiich 
are juat filtered a bucket at 5 time, not by individual item,, The graph 
structure exists Ohiy tn guide the indexer to this situation or tha aid 
case of one seal I bucket 3 b yet unrehaBhed. IThers is ho difficulty in 
interleaving Layers of graph and amorphous bucket Met,) For example: 
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Figure [V.7 

Hers, from til theca are eight brg buckets, too nany to examine Oh* by OhO 
(let us assume). Houever, If the fetch ext is [4993 1) , after one step 

dour* the tree there Is only One big bucket ieft to uerry about, the one for 
(BB&3 ... 13, Uhich can be compared yith 14393 *.i 1) and ignored. 







]¥,A*2 Excision g-f Cxt Layers 


To ifcplenent packst-c I usurn gxclusien,. we have to have a nay to remove 
cuts from a merge* The -may ] nave developed* although seneuhat 
unintuitive, serves this purpose Util enough, [It is 90 unintuitive that 
you may uant to skip this aectifim. If you Pish to take tiy word ter It that 
Cxt layer exciaion is Compatible with ext indexir-g.) 

A firs? guess at how such alteratione ought to work ia to generalise 
tha usual P0P-C3NTEKT operation, and el leu any layer to be removed from a 
cut, However, this turne nut t* be not as useful as the Seemingly more 
Indiract method i will describe, for two reasons. First, we want the 
operation to iBpI ament packet exclusion* Since there may be more than one 
closure of a packet in a given ext land a given layer may bo in nore than 
an* pocket) ue fluat solve the problen of hew to ‘‘undo" just ona Inclusion 
Of a packet, removing only those layers with no othor attachment to the 
ext. Second, the alteration hethod must presage the usefulness of any 
index structure that may have been pr*vi C j S |y buNt on a ext. 

Thorefore. it is OESirabl® to represent an excision In terns of 
alterations to the nerge history graph. Let us augment the uaual raeryiB 
operation with the link cptl S notation to the effect that one edge Of the 
g-aph ie to be invisible in the result of the merge and c*te later derived 


from it: 


in Fi 0 . iV,S r 
been deleted, 
effect. ' 


In Fig* ]V + 3, 
i b another I irk 
excision of "* 
Sometimes 


(U 



Figure IV.a 

layer 7 tiaa been pushed onto tS £ 4 3 2 L) . and layer Z baa 
However* deleting a link doesn't aluaya have any Immediate 



Figure ]V.9 

(ayar 1 cannot oe flushed from ext {5 4 3 2 15 * because thane 
It to It from (4 3 3 IS. But 16 5 4 3 2J* (jonnrated by 

i " has lost iayer 1, 

de(ating a link r&soves tiore than one iayeri 


u> 


C2 1) (3 11 



IG S 2 LI 
Figure 3V„ 

Here, snapping the t i nk renoves the last tie to fayer 3 as wail ae layer 4, 
Unfortunately, there day de mdre than one path to a H nk (which 
corresponds tp ita dting part of a twice- i PC i udad packet). In thie ease we 
feuet spec t f y which Ohs He mEan: 
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(3 a 7 S 5 4 3: 2 1) 

Figure IV+11 

Hsrs, ue mean to delete one "pecurronoa" of the link H * t h as det#r*imad by 
pts subsequent history. ThE cut It specified by its target link and 
the path taken to it h Since there remain* another path* layers 3, 2, and 1 
remal ru 

in general, the rule is that tutting a link during a bergo causes the 
excision Df al f layers whose prlaiary exts can no longer he reached from the 
result of the merge, A c*t cannot de reached only if every path to every 
link immediately be:on it is put by snmg nod? on the path. 




in Fig- [Y*L2, there are four paths to []nk "*" froit (B 7.. + L)« 
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Figure [Y.12 

Only two of the paths are cut in producing i9&7G54221), t0 layer 1 
regains in the nerge. 

3* a cut diagrats ieaves a path anbiguity, iry contention la that all 
paths to thE cut link are ainu!tanecuaIy flushed, 

hare ie another eaarrole of a merged and cut graph: 
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A EB 7 S 4 2 11 \ 

A 2^way ^ \ 

Ambiguous Link \ 

(3 7 S S 4 J 1) 

Fituire [V, 13 

The reader should convince hi nee If of the correct ness, given the aarge 

Mstory graph, Of each pf the Cxts in Fig. 3V. 13. 

S 

Now that link cult 1 rig ib underload* [ gm in 3 position to oa scribe Its 
advantages in rr.pre detail First, It H complete, in the sense that a c*t 
with 3ny set of layer* can be generated by a mergo with appropriate cute. 
Namely* merge the primary cut* oF each lager, with a out of every link 
between cwtst 



(7 G 5 4 3 2 l) 


Figure EV.14 


Second* as promised* link-cutting i; a natural implementation of packet 




exclusion* exanple, say Clyde and Benny are elephanle, and every 

elephant ha& an LTUSK and an ^TUSK. 
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CIydel 


figure [VI-15 

Tne papke t>eloaurE [TU£K-PK7 {RTU5K CLVOEI ) can be renovEd by cutting 
the graph this uay: 



Figure IV. 16 


The third and earn advantage I s that me can uaE the same Indexing 
strategy as for uncut graoha. AM ue have to be atMe to do i& invert a cut 
ufcth acne Cut branches* ]n itflny case such as Fig, [V,iS, ttiie is trivial. 




slrtcB all we have do^g is prune some redunaanl branches^ 

Problems arise with cxta like (10 E 45 of Fig, IV. 13, Inverted, Its 
graph 1* 

0 

.!> 

12,11 (3.1? 

14 3j2 1) 

15 4 2 II 

I 

(10 5 41 
Figure ]V,17 

uhtr* the dotted areas a^g not there any acre, On fetching from this cM, 
the SyStsn needs a new w3y to attend the key path <B>. 

The simplest approach is do retrieval the same as before, but to Ignore 
the sub-buckets with key paths *8, l>, <0, l i 2>, and <3 h 1, 3> lif they 
have been rehashed)* and just use them to get to th* tfl, L, 2 t 4> sub- 
bucket- A eo lot ion like till a nay be painful if we have to follow a long 
■'phantom 1 trail like this* only to find an empty bucket at the end. But 
there it no reason not to hang the c&, 1,2, 4* sup-bucket directly from 
the <B> bucket for future reference, and give it the new key path *0, 4>, 
Cut (4 3 2 II decomas a "pseudo-chrld B of 3, The link from the <0> bucket 
tb the <B, 4* sub-bucket is only UBed far sub-e*ts of lit £ 4] and ite ilk, 
(Notice that the sygtem must du this for every bucket in the Inde* rghashed 

an the** extg; having organized thi (COLOR_ \ bucket this way will not 

help ua on a fetch bf fSEZE,*,L) 

The Implementation of cut exclusion is compatible with the trick of 
associating with every bucket a list of Its suO-buckete, to be used! to 
Spaed up searches for isolated sub-buckets. (Sect, fV.ft.U Pseudo-chi I a 
I inke dp not altar the aub-bucket ilsl, tut merely shorten the pa the to 


sorTE sub-buCkete in ectie c i rcum glances r 


IV.A.3 Canesl latian 

So far, I have treated the cancellation of a cmarker (Sect, II. A. 2 ] as 
an infrequent occurrence It (b handled by the stl I f-mcMeary c«t 
filtering step that it dore an tha final output from the inde*. This is 
prater it OMy a feu data are car-col l?a in a pa r t icullar c^t. 

However, there may bo cases in which cancellation ie *p frequent that 
ttie Syaten ] havd outlined works less; wall. {] t can't do gorge than th» 
current algorithm.) For example, aasune again that c*ta rare being uaed to 
(node! successive situations, whose approkinate time ie represented a» i temH 
of farm [TINE £)„ A sequence might look like thrsi 

-CXT1- Cm. - CXT3 -CXT4 -- 

(time -'—X (time attfrrnopn) —-> 

siorni ngk I tine --X 

lunch! 

Figure 1V. 1-5 

The X’s mark the cancellation of an I ten, Each successive cut ie formed by 
tJUBhing the previous pus. tflrtBequenfEg, a FETCH of ETlhE 7P in CXT4 mil 
union the sdl^&uekets of tha «T|n£. CAR» bucket tor CKT'i 1. 2 t 3, and 4. 
Only one of tha three items found ml M survive th fl test for canes I I atl on. 

By making the string of CMS and items longer* the efficiency can be made 
arbitrarily fou, 

fiy feeling ie that in practice this probien will net be serious, 

I 

because the only exts that get fat enough to require Sub-indexing are the 
more or lets permanent ones that are planned too darefuly to be the result 
Of a long string of esnea I lat i ong. A a ditcussed rp cHcOer :»a 1 1, 1974a>. 
there is a need to reorganize and epitonize c«t sequences periodically if 



they arcs to represent long stretches- at Hue in -i ygcful May. During such 
u summary, the CSncci lations tend to be left Out* 

However* for Curious purist? uho thifik it important that an algorithm 
a I uays «drk* there sre wayg of avoiding sons qf the trouble, If you think 
of canee I I at I Oh as a complement operation super i tipoocd on the union of sub- 
buCketS for different layers, the problen is that taking one big set from 
another can giver- an 'irritatingly small return from a large investment 
(uhilo an expensive union is a I May a uar tfi i t'l , Once the system has core 
such a Complementation* however, it might as us I I trg to save the re-aul t 
for future reference. For example, in ths tcTir^u CAfl» bucket for CXT«i„ 
ue can (after one reference! nark, the sub-bucket ( [TsFlE AFTEflNDDNt I as 
"complete*" meaning that IPS buckets above it should Oe ignored, not 
unioned and filtered. The problem Ml th this asproach is that BDlne 
bookkeeping is required to unnark it if an item is added or removed trpM 3 
super-bucket, [Qne scheme; every bucket could be narked uith its "time of 
coapletl&n"; a complete bucket u&uld he accepted only if Its time of 
completion Has after the completion- times of all its superiors.) 

IV,B Interaction uith the TDEH ]ndexir 

[ have referred vaguely to the rehashing of certain "buckets" into 
"eub-byckets" bj ext structure.* 3n Sect, I ], L tteecriced an independent 
duck s t - sub-duck e t system based on pattern features. It has to be decided 
hOH these systems should interact. 

Logically, thBre is no particular problem, each step in a FETCH, 
the indexer has a bucket and a rercaining set qf features and exts that it 
hasn’t used yet, If the bucket is i ran list, ll takes that* If It ie 
broken down by features, it usss the features it has; if by cm t, it go-es 


another step douri the inverted fetch cut. A Her each step, it ha a a new 
bucket and ; £SS regaining key material* 

The problem is purely strategic, SJhon an averaized bucket need a to ba 
rehashed-, there la a Choice as to which □ f the regaining keys to rehash an. 
For example, if (TETCH ' (P A ?}) findE too many itema, should it rehash tha 
*F, CAH> bucket by at on* or by e*ts? 

Notice that no matter how ien .9 -a bucket is, it ia "too long 1 * under Juat 
two kinds of circumstance: either the exf f liter ing stap or the natch 

filtering step rejected too large 3 fraction Cf its inputs Tha proper 

rehashing strategy, therefore, is to rehash on the key corresponding to 
which of the two problems actually occurred. Thus, if there are twenty 
items in the «P, CAR>> bucket, but only one is in the fetch c*t 14 II, the- 

eyetei* ahould rehash by ext end take the <cP, CAH>, 1> aub-buckst. If 19 

af the Hens are in the Cxt, but only bne or tuo match (P A ?)„ It should 

rehash by features, Phd take the ^cP, CAR*. <A, CAl!]£l» aub-buckat. 

Of course, there ire exceptions. If the ext filtering atep H a 
inefficiency ia due to a lot of cancel ed items, different remediea are 
called for* (See Sect. IV.A.3.] af a bucket is already rehashed, as is the 
«P h CAR>, <P + CARj> Sub-bucket of Fig. 11.4, there Is no point in 
rehashing by features. The inefficiency; can happen in thia oaaa only If 
the fetch pattern ia something like IP ?K ?X.) and there ana tco many i tema 

of the fore EP si 52) with sj ps£ h [J do not know if therB is a way to 

handle this case efficiently, cr hou important it ie.i 

With this schema, the aystem can start out with one bucket for all 
data, with key <>, it treats this top-level bucket the way it would any 
other, rehashing It by relevant key whan it gets too big. 


iv,c Secondary Storage 


An advantage of the Cwt indexing stherre over the jjura cwt filter imp of 
thi TDBH is mat it appear*, to be compatible with efficient use of a Inti, 
nan-random-access atoragp, Because ue do not actually have to. see Items 
not ill the current Gwt, they can be judged! out onto a disk or drum. 

Houser, use of secondary storage util require development 9 # a record 
1/D system for handling i ten data. It ui N be impossible to uae LISP 
addrauaes, sine® U3P h s address apace is so small [and -since Host LISFs 
MhH a me9S &f ^scondary storage use)* In particular, [ doubt that syahcls 
can be stored as U&P-like atom*, uni ass there is a largo □ parr ay-1:1 ho hdah 
table on the disk for simulating LJSP's obarray. [Prtsurr.aa I y t the 
analogous "uniquizlng* function far Item, called □ATUH in Cotinlver* M Hi 
have to bo foregone, since the ayaterr can’t afford to Search the disk, for a 
variant of a new iten.J 

So there are tun approaches ace tan take: either wait tor a LISP ui th a 
large, efficiently organised address space. or debug the following hack: 

Assume symbol* are represented ay Character strings. The system nil! 
have to use USP'a B£AQ (Or a streamlined version) to read I terns from thu 
diak. Of course, we do not have to specify all of the characters lb 3m 
item's printed representation, but only these left unspecified by the hoys 
to the ducket the characters were found in* For axanple, if the «P , CAR» 
bucket of Fiy. 11.4 uere written to the diek, it could be stored as I [at , 
th) , (cl T (dJ , tel) (plus c*t markers]. Furthermore, the system can save 
time by postponing the actual JEADing of characters uhen they are drought 
in, until they are needad to reconstitute a bucket. That is, there Is a 
buffer step between aaking for a bucket and reading from the diak: 


asks for auks for 

INDEX-FETCH <---— SUFFER -DISK 

bucket record 

» c -Indicate* data flaw 

Figure ]V,1S 

The moat crucial requirement tar such a system is that It respect 
locality of reference with respect to ext. That 5 a, whan on* bucket for a 
Oft Lo brought in f other buckets for different features In the samg ext 
ehduib be stored in the sane records* This carv be achieved by keeping a 
list of alt buckets and sup-indexes directly associated with a ext layer,, 
and writing them out together* flhl s will be Cheaper if buckatB are broken 
down by ext defers feature-*] C U Sect, IV.B,I Since the indexer ntefJs the 
euO-buckeU for alp super-cxts when FETCH i ng from a ext* a FETCH from a 
long-neglected owt might cause several whole branchHB of a sub-Index to 
brought In from disk. 
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In Fig. EY.2ET, the items thwin are written out in order fP at, IP bl + CQ 
cl . 4D cl , IP el, IQ sl a (Q f ],...» A reference to (P 73 in exf U) uil I 
anticipate a raferenc* td IQ ?> by bringing IQ cl In. 

The complexity of the issues regarding secondary-storage management is 
the nein obstacle to implementing the ext indexing aysten ] have described, 
Uithout taking secondary storage into account, an implementation would 
Oogin tp thrash, from external reasons {limitations Of PDR-lfl’s and LISPbI 






before 1 ts advantages began to let L 


V 50 WHAT 7 

“The reader should toy new be eenvincad that it is possible to Implement 
large P I anner-type data bases ef f leiwnt ly h if they can be uie I I organized, 
The time Required to fetch a pattern trotn a ext depends pn the size of the 
pattern and the length of the Cxt, and to 3 snail decree on the number Df 
C.eaoly related data that might seriously confuse 3 mora Standard system. 

It uou | cf ba nice if i could prove a theorem regarding the average cost 
In time and space of using a systen like the one [ have described, but I 
tack the expertise to do it- [ find it hard to inagtne cases in which its 
behavior would he unacceptable,, but I Bl* probably over looking sone peculiar 
interacticnj, Host of the Interactions 1 see are actually beneficial. Far 
example, the mcrE features in a pattern., the more sub- inheres ta I opk 
through, if there ere o lot of data. On the Other hand, uhen a pattern has 
a lot Of features, there aro unlikely to be near duplicates of It in other 
cats, so the bucket tnat is ultimately found will probably not be rehashed 
by cm t at all. Sc pErfprrehc* ia probably seldom degraded on both typES of 
key. I cart’t prove to what degree this Is true, however. 

i he ease is even more confused with my packet 1 Kp I Ohentat i on, uhich 
reliee on the organizing power pf the frane concept* Here what fe needed 
I* experience, uhich [ hope we ui I I all soon have mere than enough of* The 
most pressing question wft should bs asking ls f hew well cues b program have 
to organize itself to avoid choking cn irrelevant packets qr frames? 

Bruts-force efficiency is thus a subordinate issue. A mare interesting 


cause I would like to '‘ally to is the defense of Scott Fahltiari'e sarly 
paper "A Hypothe* iS-Frflne System for Recognition Problems' c!973> from the 
critique In hie recent "A System far Represent Ing and Using Real-World 
Knowledge." <1975;» The f ■ f-* t paper uas an elegant e*.oOa | t i on of tho 
develop,fig frarte theory. It is const Stent with the work of Hinaky <t974>, 
Kuipcrs d375:*, kiincgrad *1974^, RuOln <137£>, Harc.g-3 ^1975?, and floor e and 
Nowell <J.973>, most of uhori plain to be working on franes. People who 
admire all this research can sea hou the mechanisms ] have described could 
be helpful. 

_r may-Oe they cgn' t, One problem uith ny -tJtpoa I t ion that makes it look 
different fr&ii previous franre theories is that [ haven’t menrioriBtf 
recognition at all. Host of the theories are about nothing but 
recognition. The compotatiana I problems all revfl|ve around guessing a 
frame to account for data* filling in tnat frame uith data,, proposing 
SUbframea, building super-frases, transforming to neu frames, and S 3 on. 

The only apparent purpose of a frame is to embed it in a more inclusive 
frana. 

Probably this is a minor oversight. We are expected to- see by 
■oijreelvGS the nany advantages of finding an instance of a Sarye "almost 
right" chunk of data like a frame. For cwamplo, after a mEdicai diagnosis 
^Rypin, 1975>, d doctor program has a structure of frarses representing a 
hypothesized constellation of diseases. Row does it know whal treatment to 
prescribe? Presumably, there is a frame slot TREATMENT - AMPUTATION, Dr 
something, which oan j ua t Oe read off. 

Indeed, 'just reading it off" seeds to be the retrieval mechanism frame 
theorists long for. A frame represents a body of knowledge that id we-H 
understood. "A frame is a specialist In a sma I 3 domain.* ncKuipera, 1975, 
p, 1B2* This is why typical frane contents tend to be slots and values. 


with all Other information represented as auxiliary procedures, [ndbad* 
the alGt-value system reminds me Of nothing so r»ueh as a called function 
with all its ^ar i ah I es bound [with son* $lol& being bode I! led as cc t i on a I 
arguments with default: values). I[uipers p frah® implementation ie just like 
this. His frames are processes uh i ch coroiiyri icatc by sendirg each other 
requesta for slot values and the like. 

All of thia concentration on slots, values* and Hessasrb passing make a 
rtns unhappy, Uhy did all these people abandon the Planner-type data base? 
There is no real difference conceptually in how a Pfanner-type index stores 
slot-value pairs* Clearly* if all assertions are of the form ICDbDR REDI 
or [TREA itlENT AiPUTATIQI^) * it doasn’t matter whether they are stored in an 
a-liet or a e*t* The problem lies in represorttIhg more complex pieces pf 
information, Tike 'the way to spot a gngrd Is that hi a aocka don’t natch . m 
Halt, this is a problem: it might be £lre AS problem. 

Let me suggest that a rrost important feature ef a representation of¬ 
fsets like this is that it be obvious Mhflt the fact is and why it is 
bo I ievB-d and ursat might have been believed Ih its place and how it ndghfc 
have been used* as well as hou it fj being used. Unless a program is such 
an expert that it knows* by reading off a slot, what to dp in any 
oooceivab > e situation* someday it is goang to fail to solve a problem and 
have to dtbug its world model* [Cf. tSussnan, l97£x] ]t Hill have to start 
from the debris of its previous efforts. If this dibrii consists only of 
prpgrau counters ef p«- 0[:e g EeE confused Sy Conflicting slots,, It will! bo 
useless. A program that relies eon*:? I m b ly op passing messages of 
anticipated form can have cnly unpleasant surprises, intelligent people 
are pleasantly surprised by pros 1 ®n e- all the tins, 

Anyway., packets and c*t& can support all the usual ways of doing 
recognition: let the data suggest a frame: fbdue attention by activating 


an '':J S frames at a Ei-mei rely on ’‘berncrm" I i f-added interrupts, for 
aK^mpIc-t to model "noticing"; let the frame:: guide the search; allow 
default facts which are easily replaced. Hopefully they m i 31 support 
whatever Other functions francs are going to haws, 

Fati I man" a recent paper., "A System for Representing and Using- Real-Ucrld 
Knowledge* tl375> repudiates such of this frame tradition, and advocates 
tPa use of large parallel networks of 'concept nodes" for storing 
information and doing recognition of unfamiliar objects. Fahiman claims In 
this paper that relevant sunset* Of a large set of data cannot be 
efficiently retrieved from a typical ft] data Structure; if ‘relevance" can 
Pt modelled using cuts tor framas}, 3 an confident this claim is tslacu 
Hi* other -claim s that the ucual frame-tneoretic approaches to recognition 
I mentioned in the preceding paragraph are not a) powerful as doing 
para I lei interiactions of large sets of nodes, each represanting the 
concepts uith a given property, in orc-sr to suggest a familiar concept that 
might share -al O' 1 a set pf properties. These i nter sect ions, uhich reguirs 
special-purpose parallel hardware, can be made to solve Other well-known 
problems. Fbr e*au,ple H disambiguation of co-occurrent terms like "pitcher" 
and 'diamond 1 can be accomp I i shed by intersecting their sets of possibly 
cunts *t s. 

This propo-ssl stand* In direct opposition to the usual mechanisms 
proposed by frame theorists to do these tasks. Each theory makes different 
assumptions about where the cdfiplcKity iesj, The fundamsnta1 assumption of 
frame theory I* that a problem solver should nee t a difficulty by finding 
an organized body of knowledge that applli* to it. “or example, Marcus’s 
<L37S> 'uaif-and-sse parser' use a- knowledge eeout Choosing between parses 
to avoid having to back up. It doesn't have problems requiring: large 
intersections for solution, because It aluay* phrases Sts recognition 


praqIrn terns which it knout are correct, 

k Imnguegg system knows that "Fido" ]& a noun, but it also knows, under 
what cirow*stances to do pat tern-rial eh i ng on 'Fide' [say, in the 
phonology), and when to call it a noun (in the syntax). It would ba 
pose ip Is to do FahI manesqae "barker sweeps" to implement detion calls 
triggered by struetu-ea the eye tom builds. but, given these guide linea. 
which are neeoed o n Indeoenaent Einguiatic grounds, there are cheaper ways. 

Another exanr.ola come a frori iJflitz'a <1373> research on vision of scenes 
with shadows, Hia method would aeefl to benefit greatly fron parallel: 
marker sweepa, until closer examination reveals that enough single 

knowledge has bean draught to bear cn fhs problen to ngke tha par a 1 II e I i sm 
upriB cs s E-ary. 

]t seems at present that the evidence favpra frane theory's account of 
ti‘is tOIrp I oh 3 1 i as of ihte- Iigenet, but the issues are too COhplax far a 

verdict non* At any rate, cuts case efficiency dots not seem ts b* the 
i ssu-a, 


A ckcoivT e tfynTB 0 ts -- 

Th*5 paper is the result of conversations with Scott Fahlman, Bop 
rloore, and David Harr * The ext indexing scheme of Sod. [Visa descendant 
of iff core's solution to the Symbol“ napping problem. Gerry Susaman and 
others as referenced are responsible for noet u I omenta of the TDBfli 
Suaaman, Ronald Rivest, and others HSda helpful suggestions far improving 
early drafts qf this paper. 
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